Efficient Speech Recognition System for Isolated Digits

نویسندگان

  • Santosh V. Chapaneri
  • Deepak J. Jayaswal
چکیده

In this paper, an efficient speech recognition system is proposed for speaker-independent isolated digits (0 to 9). Using the Weighted MFCC (WMFCC), low computational overhead is achieved since only 13 weighted MFCC coefficients are used. In order to capture the trends of the extracted features, the local and global features are computed using the Improved Features for Dynamic Time Warping (IFDTW) algorithm. In this work, we propose to reduce the time complexity of the recognition system by time-scale modification using a SOLA-based technique and also by using a faster implementation of IFDTW. The experiments based on TI-Digits corpus demonstrate the effectiveness of proposed system giving higher recognition accuracy of 99.16% and performing about 22 times faster than conventional techniques. Keywords-Speech Recognition; MFCC; Dynamic Time Warping, SOLA

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Speech Recognition System of Arabic Digits based on A Telephony Arabic Corpus

Automatic recognition of spoken digits is one of the difficult tasks in the field of computer speech recognition. Spoken digits recognition process is required in many applications such as speech based telephone dialing, airline reservation, automatic directory to retrieve or send information, etc. These applications take numbers and alphabets as input. Arabic language is a Semitic language tha...

متن کامل

Development of a Real-time Asr System for Slovak Speechdat Database

This paper describes development of a real-time speech recognition system in Slovak for the voice-operated telephone services. The system is based on SPHINX2 platform. The decoder using Hidden Markov Models was trained on the SpeechDat-E Slovak database. It is speaker independent, large vocabulary, continuous speech real-time automatic speech recognition system. Test results are given for the t...

متن کامل

Continuous Digits Recognition Leveraging Invariant Structure

Recently, an invariant structure of speech was proposed, where the inevitable acoustic variations caused by non-linguistic factors are effectively removed from speech. The invariant structure was applied to isolated word recognition and the experimental results showed good performance. However, the previous method can’t apply to continuous speech recognition directly because there was no effici...

متن کامل

Isolated English Language Digit Recognition Using Hidden Markov Model Toolkit

The main purpose of the study was to develop a speech recognition system for isolated digits of English language using HTK. Speech, in addition to being a tool of communication, is also a symbol of identity and authorization. Two different corpora were collected of audio recordings of isolated digits of English language speakers, in which speakers read numeric digits. Both of the collected corp...

متن کامل

Using a Telephony Saudi Accented Arabic Corpus in Automatic Recognition of Spoken Arabic Digits

In this research, spoken Arabic digits are investigated from the speech recognition problem point of view. The system is designed to recognize an isolated whole-word speech. In the training and testing phase of this system, isolated digits data sets are taken from the telephony Arabic speech corpus, SAAVB. This standard corpus was developed by KACST and it is classified as a noisy speech databa...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2013